Remove `compressed` flag from all mmseqs modules #7211

jasmezz · 2024-12-12T16:40:50Z

We've been struggling a long time with mmseqs modules which wouldn't run with the GTDB database (in the nf-core/funcscan pipeline in our case). The reproducible error occurred always late in the "prefilter" step of the mmseqs runs with infos like aggregatetaxweights died.

@Darcy220606 recently solved the mystery:

[The flag] --compressed 1 is hardcoded across all mmseqs module on nf-core, that apparently is creating a problem when no contigs pass the prefilter step at a certain kmer size, which is a bit downstream in the general prefilter process (esp. that we have a relatively strict parameters). This confuses the tool as the files are 'emptily compressd'.

Hence, we ran all modules with the --compressed 1 removed and now they run through successfully even with the big GTDB database.
Taking this as confirmation, the flag is removed in all mmseqs modules in this PR. The flag can then be re-added as a parameter on pipeline level if needed.

PR checklist

Closes #XXX

famosab · 2024-12-13T14:10:39Z

Seems like some of the snapshots need to be updated for the tests to pass :)

Remove compressed flags from all mmmseqs modules

b886a8b

maxulysse approved these changes Dec 12, 2024

View reviewed changes

vagkaratzas approved these changes Dec 12, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove `compressed` flag from all mmseqs modules #7211

Remove `compressed` flag from all mmseqs modules #7211

jasmezz commented Dec 12, 2024 •

edited

Loading

famosab commented Dec 13, 2024

Remove compressed flag from all mmseqs modules #7211

Are you sure you want to change the base?

Remove compressed flag from all mmseqs modules #7211

Conversation

jasmezz commented Dec 12, 2024 • edited Loading

PR checklist

famosab commented Dec 13, 2024

Remove `compressed` flag from all mmseqs modules #7211

Remove `compressed` flag from all mmseqs modules #7211

jasmezz commented Dec 12, 2024 •

edited

Loading